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Abstract 

This paper formulates and studies a general distributed field reconstruction problem using a dense 
network of noisy one-bit randomized scalar quantizers in the presence of additive observation noise of 
unknown distribution. A constructive quantization, coding, and field reconstruction scheme is developed 
and an upper-bound to the associated mean squared error (MSE) at any point and any snapshot is derived 
in terms of the local spatio-temporal smoothness properties of the underlying field. It is shown that when 
the noise, sensor placement pattern, and the sensor schedule satisfy certain weak technical requirements, 
it is possible to drive the MSE to zero with increasing sensor density at points of field continuity while 
ensuring that the per-sensor bitrate and sensing-related network overhead rate simultaneously go to zero. 
The proposed scheme achieves the order-optimal MSE versus sensor density scaling behavior for the 
class of spatially constant spatio-temporal fields. 

I. Introduction and Overview 

We study the problem of reconstructing, at a data fusion center, a temporal sequence of spatial 
data fields, in a bounded geographical region of interest, from finite bit-rate messages generated by 
a dense noncooperative network of sensors. The data-gathering sensor network is made up of noisy 
low-resolution sensors at known locations that are statistically identical (exchangeable) with respect to 
the sensing operation. The exchangeability assumption reflects the property of an unsorted collection of 
inexpensive mass-produced sensors that behave in a statistically identical fashion. We view each data 
field as an unknown deterministic function over the geographical space of interest and make only the 
weak assumption that they have a known bounded maximum dynamic range. The sensor observations 
are corrupted by bounded, zero-mean, additive noise which is independent across sensors with arbitrary 

* Y. Wang, P. Ishwar, and V. Saligrama are with the Department of Electrical and Computer Engineering, Boston University, 
Boston, MA 02215. Email: {yw, pi , srv}@bu . edu. 



1 



dependencies across field snapshots. This noise has an arbitrary, unknown distribution but a known 
maximum dynamic range. The sensors are equipped with binary analog-to-digital converters (ADCs) 
in the form of comparators with random thresholds which are uniformly distributed over the (known) 
sensor dynamic range. These thresholds are assumed to be independent across sensors with arbitrary 
dependencies across snapshots. These modeling assumptions partially account for certain real-world 
scenarios that include (i) the unavailability of good initial statistical models for data fields in yet to be 
well studied natural phenomena, (ii) unknown additive sensing/observation noise sources, (iii) additive 
model perturbation errors, (iv) substantial variation of preset comparator thresholds accompanying the 
mass-manufacture of low-precision sensors, (v) significant temperature fluctuations across snapshots 
affecting hardware characteristics, and (vi) the use of intentional dither signals for randomized scalar 
quantization. 

Building upon prior results in [1], [2], and [3], we develop a simple coding and field reconstruction 
scheme based on one-bit scalar quantized samples of noisy observations. We characterize the associated 
scaling behavior of the MSE of field reconstruction with sensor density in terms of the local and 
global moduli of continuity of the underlying sequence of fields. This MSE characterization is for 
fixed, positive, and equal sensor coding rates (bits per sensor per snapshot). These achievable results 
reveal that for bounded, zero-mean, additive observation noise of unknown distribution, the MSE at 
every point of continuity of every field snapshot can be made to go to zero as sensor density increases 
while simultaneously sending the per-sensor bitrate and any sensing-related network rate overheads (e.g., 
sensor addresses) to zero. This is possible if the sensor placement and sampling schedule satisfy a certain 
uniformity property. This property ensures that the field estimate at any given spatial location is formed 
using the observations from increasingly many sensors that are located within a vanishingly smaller 
neighborhood of the location. 

The MSE results of this work pertain to uniform pointwise convergence to zero, that is, for every 
spatial location of every field, unlike results pertaining to spatially and temporally averaged MSE which 
are more commonly encountered. The rate of decay of field reconstruction MSE at a given location 
is related to the local modulus of continuity of the field at the given location and time. Specializing 
these results to the case of spatially constant fields yields an achievable MSE decay rate of 0{1/N) 
where N is the sensor network sizellj A Cramer-Rao lower-bound on the MSE for parameter estimation 

'Landau's asymptotic notation: /(TV) = 0(g(N)) O lim sup^^ \f(N)/g(N)\ < oo; f(N) = tt(g(N)) g(N) = 
0(f(N)); f(N) = @(g(N)) O f(N) = 0(g(N)) and g(N) = 0(f(N)). 
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establishes that the 0(1/N) MSE scaling behavior is order-optimal in a minimax sense. Since in our 
problem formulation, the per-sensor bitrate is held fixed and equal across sensors, in a scaling sense, the 
MSE decreases inversely with the total network rate. 

Previous estimation-theoretic studies of one-bit distributed field reconstruction have focused on recon- 
structing a single field snapshot and have either (i) assumed zero observation noise [1], [2], or (ii) assumed 
a spatially constant field (equivalent to scalar parameter estimation) with a one-bit communication as 
opposed to a one-bit sensing constraint [3]. The system proposed in this work integrates the desirable 
field sensing and reconstruction properties of these apparently different one-bit field estimation schemes 
and establishes the statistical and performance equivalence of these approaches. An important hardware 
implication of this paper is that noisy op-amps (noisy threshold comparators) are adequate for high- 
resolution distributed field reconstruction. This should be be contrasted with the framework in [3] which 
implicitly requires sensors to have the ability to quantize their observations to an arbitrarily high bit 
resolution. A side contribution of this paper is the holistic treatment of the general distributed field- 
reconstruction problem in terms of (i) the field characteristics, (ii) sensor placement characteristics, (iii) 
sensor observation, quantization, and coding constraints with associated sensing hardware implications, 
(iv) transmission and sensing-related network overhead rates, and (v) reconstruction and performance 
criteria. We have attempted to explicitly indicate and keep track of what information is known, available, 
and used where and what is not. 

The randomized scalar quantization model for the sensor comparators not only captures poor sensing 
capabilities but is also an enabling factor in the high-fidelity reconstruction of signals from quantized 
noisy observations. As shown in [4] in an information-theoretic setting, and alluded to in [2], the use 
of identical deterministic scalar-quantization (SQ) in all sensors will result in the MSE performance 
being fundamentally limited by the precision of SQ, irrespective of increasing sensor density, even in 
the absence of sensor observation noised However, our results further clarify that having "diversity" in 
the scalar quantizers, achieved, for example, through the means of an intentional random dither, noisy 
threshold, or other mechanisms, can achieve MSE performance that tends to zero as the density of sensors 
goes to infinity (Section ttll-Al Implications). Randomization enables high-precision signal reconstruction 
because zero-mean positive and negative fluctuations around a signal value can be reliably "averaged 
out" when there are enough independent noisy observations of the signal value. This observation is also 
corroborated by the findings reported in the following related studies [l]-[3], [5]-[8]. 

2 The problem will persist even for identical block vector-quantization (VQ) with identical binning (hashing) operations. 
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The results of this work are also aligned with the information-theoretic, total network rate versus 
MSE scaling results for the CEO problem which was first introduced in [9] and thereafter studied 
extensively in the information theory literature (see [10], [11] and references therein). However, it 
should be noted that information-theoretic rate-distortion studies of this and related distributed field 
reconstruction (multiterminal source coding) problems typically consider stationary ergodic stochastic 
fields with complete knowledge of the field and observation-noise statistics, block-VQ and binning 
operations, and time and space-averaged (as opposed to worst-case) expected distortion criteria. In VQ, 
sensors are allowed to collect long blocks of real-valued field samples (of infinite resolution) from 
multiple field snapshots before a discrete, finite bit-rate VQ operation. The fields are often assumed to 
be spatially constant and independent and identically distributed (iid) across time (frequently Gaussian) 
and the observation noise is often assumed to be additive with a known distribution (frequently Gaussian) 
as in the CEO problem. It should also be noted that the MSE scaling results for the CEO problem in [10] 
are with respect to the total network rate where the number of agents (or sensors) has already been sent 
to infinity while maintaining the total network rate a finite value. Recent information-theoretic results 
for stationary fields under zero observation noise have been developed in [12], [13]. There is also a large 
body of work on centralized oversampled A-D conversion, e.g., see [14] and references therein. Our 
work does not explicitly address physical-layer network data transport issues. In particular, we do not 
consider joint source-channel coding strategies (however see remark before Section IIV-AI ). For certain 
types of joint source-channel coding aspects of this and related problems, we refer the reader to the 
following references [15]-[20]. Networking issues such as sensor scheduling, quality of service, and 
energy efficiency may be found in [21] and references therein. 

The rest of this paper is organized as follows. The main problem description with all the associated 
technical modeling assumptions is presented in SectionJI] The main technical results of this paper are then 
crisply summarized and their implications are discussed in Section |III] Section ITVl describes the proposed 
constructive distributed coding and field reconstruction scheme and the analysis of MSE performance 
which leads to the technical results of Section |nl] For completeness, in Section IIV-AI we also briefly 
discuss sensor deployment issues but this is not the focus of this work. In Section |Vj we discuss the close 
connections between the work in [2], [3], and the present work, and establish the fundamental statistical 
and performance equivalence of the core techniques in these studies. We also discuss how the scenario 
of arbitrary unbounded noise and threshold distributions can be accommodated when the statistics are 
known. We conclude in Section [VT] by summarizing the main findings of this work. The proofs of two 
main results are presented in the appendices. 
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Fig. 1. Block diagram of a distributed field reconstruction sensor-network using randomized 1-bit SQ with block- 
coding. Sensor i quantizes its noisy observations, Ya, . . . , Yir, to the binary values Bn, . . . , Bit. The sensor then generates 
the message Mi € {1, . . . , 2 rT } based on these quantized values. These messages {Mi} are then relayed to the fusion center 
where the field estimates St are produced. 



II. Distributed Field Reconstruction Sensor-network (DFRS) Setup 
A. Field Model 

We consider a sequence of T discrete-time snapshots of a spatio-temporal field[] Each snapshot is 
modeled as a continuous bounded function, 

s t : G -> K : Vx G G, Vt G {1, . . . ,T}, |s t (x)| < a < +oo, 

where G C M. d is a known geographical region of interest in (i-dimensional real space and a is a known 
bound on the maximum field dynamic range. Although the results of this paper hold for any G which 
is bounded and is the closure of its nonempty interior, for simplicity and clarity of exposition, we will 
assume G = [0, l] d , the (i-dimensional unit-hypercube, in the sequel. Distances are measured with respect 



to a nonr 



I, which for this work will be assumed to be the Euclidean 2-norm. Since the fields are 



continuous functions on the compact set G, they are in fact uniformly continuous on G [22]. 

3 If the spatio-temporal field is temporally bandlimited then the field values at intermediate time points can be interpolated 
from the estimates at discrete time snapshots if the temporal sampling rate is (strictly) higher than the temporal Nyquist rate 
of the field. The associated MSE will be no larger than the maximum MSE of the estimates across the discrete-time snapshots 
times a proportionality constant. 

4 More generally, our results can be extended to arbitrary, amplitude-bounded, measurable functions. For such functions the 
pointwise MSE bounds given in Section UlI-AI still hold. The estimates at the points of continuity will have MSE tending to 
as the network size scales. However, the points of discontinuity may have a finite, but non-zero MSE floor. 

5 For asymptotic results in which distance — > 0, any norm on R d would suffice since all norms on any finite-dimensional 
Banach space are equivalent [22, Theorem 23.6, p. 177]. 
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Results on the fidelity of the field reconstruction will be described in terms of the local and global 
moduli of continuity associated with the field: 

Definition 2.1: (Local modulus of continuity) The local modulus of continuity u! t : [0, oo) x G — > [0, oo) 
of the function st(x) at the point x G G is defined as 

oj t (5,x)= sup \s t (x) - s t (x')\. 

{x'£G:\\x-x'\\<8} 

Note that for all x G G, u>t(S,x) is a nondecreasing function of 5 and that it — > as 5 — ► since 
st(x) is continuous at each point x in G. 

Definition 2.2: (Global modulus of continuity) The global modulus of continuity uj t ■ [0, oo) — > [0, oo) 
of the function s t (x) is defined as 

ui t (5) = sup u t (5,x). 

Again note that Ut(5) is a nondecreasing function of 5 and that it — ► as 6 — > since St(x) is 
uniformly continuous over G. 

The global and local moduli of continuity of a spatial field respectively reflect the degree of global and 
local spatial smoothness of the field with smaller values, for a fixed value of 5, corresponding to greater 
smoothness. For example, for a spatially constant field, that is, for all x G G, st(x) = st (a constant), 
we have uit(5) = for all 5 > 0. For d = 1 and fields with a uniformly bounded derivative, that is, for 
all x G G, sup x£G \d(st{x)) / dx\ = A < +oo, u)t(5) < A • 5. More generally, for a Lipschitz-7 spatial 
function (see [1]) st(x), we have ut(8) oc 8 1 . Closed-form analytical expressions of moduli of continuity 
may not be available for arbitrary fields but bounds often are. Sometimes bounds that are tight in the 
limit as 6 — ► are also available. From Definitions 12.11 12.21 and the boundedness of the field dynamic 
range, it also follows that for all 5 > 0, for all x G G, and for all t G {1, . . . , T}, we have 

< cj t (8, x) < Ut(8) <2a < +00. 

B. Sensor Placement 

We assume that we have a dense, noncooperative network of N sensors distributed uniformly over 
a hypercube partitioning of G = [0, l] d . The space G = [0, l] d is uniformly partitioned into L = l d 
(where / is an integer) disjoint, hypercube supercells of side-length {1/1). Each supercell is then further 
uniformly partitioned into M = m d (where m is an integer) hypercube subcells of side-length (l/(lm)), 
giving a total of LM subcells. In our distributed field coding and reconstruction scheme, described 
in Section llV] the field estimate for each snapshot is constant over each supercell and is formed by 
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Fig. 2. Example uniform sensor deployment and cell hierarchy over [0, l] 2 (d = 2). Here, N — 864 sensors are deployed 
over L = 4 2 supercells of side-length (1/4) and M = 3 2 subcells per supercell of side-length (1/(3-4)), resulting in 6 sensors 
per subcell. 

averaging the measurements from a partial set of the sensors determined by the subcells. This field 
reconstruction scheme requires knowledge of the sensor locations only up to supercell (not subcell) 
membership. Therefore, it has some natural robustness against sensor location uncertainty or error. The 
significance of the super and subcells will become clear in the sequel (Sections [III] and ITVl) . 

We assume that the sensor deployment mechanism is able to uniformly distribute the sensors over the 
subcells. We define this uniform sensor deployment condition with: 

Definition 2.3: (Uniform sensor deployment) We say that a sensor deployment method is uniform if 
exactly n = (N/(LM)) sensors are located in each subcell. 

Definition 12 . 3 1 describes ideal sensor deployment conditions and can be achieved by locating the sensors 
over a uniform grid. However, precise control of sensor locations may not be possible in practice. 
Since we are not primarily concerned about the details of deployment, we defer discussion of such 
issues to Section IIV-A1 where we introduce a stochastic deployment model in order to capture the 
uncertainty of realistic deployment mechanisms. In Section IIV-AI we show that this deployment method 
satisfies a relaxed version of Definition 12.31 the asymptotic nearly uniform deployment condition given 
by Definition 14- 1 L which does not significantly change the estimator performance. 

For clarity of presentation, we will assume that the deployment scheme being used satisfies the uniform 
sensor deployment condition given in Definition |2.3i We also assume that each sensor is aware of which 
subcell it is in. Figure [2] illustrates the cell hierarchy and an example sensor deployment for the d = 2 
dimensional case. 
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C. Sensor Observation and Coding Models 

1 ) Sensor Observation Noise: The sensor observations are corrupted by bounded, zero-mean additive 
noise which is independent across sensors, but can be arbitrarily correlated across field snapshots^. 
Let Zn denote the noise affecting the observation of the t th snapshot by the i th sensor, and define 
the Z = {Zit} 1 ^ t=1 (the collection of all of the noise random variables) and Z.; = {Zu}f =1 (the 
collection of all of the noise random variables for a given sensor i). The noise Z has an unknown joint 
cumulative distribution function (cdf) i*z(z) that can be arbitrary within the zero-mean, boundedness 
and independence constraints already stated. The maximum dynamic range of the noise b 6 [0, +00) is 
known. The noisy observation of field snapshot t £ {1, . . . , T} made by sensor i G {1, . . . , N} is given 
by 

Y it = s t (xi) + Z it , 

where Xi is the location of the i th sensor and Z ~ cdf -Fz(z). We use T to denote the set of all joint 
cdfs that are factorizable into N zero-mean joint cdfs on M. T with support within [—b, +b] T , that is, 
-^z(z) = ni^i -^Zi(zi) where Fz £ (zi) is a zero-mean joint cdf (corresponding to the noise random 
variables for sensor i) with support within [— b, +b] T . Note that T captures the feasible set of joint 
noise cdfs for the bounded-amplitude, zero-mean, and independence assumptions. Also note that \Yu\ < 
\s t (xi)\ + \Z it \ < c= (a + b). 

2) Randomized 1-bit SQ with Block Coding: Due to severe precision and reliability limitations, each 
sensor % G {l,...,iV}, has access to only to a vector of unreliable binary quantized samples B, = 
(Bii, . . . , Bit) for processing and coding and not direct access to the real-valued noisy observations 
Yji, ... , YiT- The quantized binary sample B: lt is generated from the corresponding noisy observation Yu 
through a randomized mapping Qu : [— c, c] — ► {0, 1}: for each i G {1, . . . , N} and each t G {1, . . . , T}, 

Bu = Qit(Yit), 

where we assume that the mappings Qu are independent across sensors i, but can be arbitrarily correlated 
across snapshots t. We denote the conditional marginal statistics of the quantized samples by PB u \Y it (y) — 
P(Bit = l\Yu = y). We are specifically interested in cases where PB lt \Y it {v) i s an affine function of y 
since it allows estimates of the fields to be made from the B^s without knowledge of the noise distribution 
(see Appendix Jl). Specifically we consider the conditional distribution 

PB it \Y it (y) = 

6 The measurement snapshot timers of all the participating sensors are assumed to be synchronized. 
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This conditional distribution can be achieved by a quantization method which is based on comparing 
the noisy observation with a random uniformly distributed threshold given by 



B, 



Ql\Y u ) 4 



l(Y it > R 



(2.1) 



where the Ru's are Unif [— c, c] random thresholds which are independent across sensors i, but arbitrarily 
correlated across snapshots t, and l(-) denotes the indicator function: 



1(Y U > Rit) = { 



1 if Y it > R it , 
otherwise. 

This uniform random-threshold 1-bit SQ model partially accounts for some practical scenarios that 
include (i) comparators with a floating threshold voltage, (ii) substantial variation of preset comparator 
thresholds accompanying the mass-manufacture of low-precision sensors, (iii) significant environmental 
fluctuations that affect the precision of the comparator hardware, or generally (iv) unreliable comparators 
with considerable sensing noise and jitter. An alternative justification is that the random thresholds are 
intentionally inserted as a random dither. Scenario (i) can be accommodated by independence across 
snapshots, scenario (ii) can be accommodated by complete correlation (fixed) across snapshots, and 
scenarios (iii) and (iv) can be accommodated by arbitrary correlation across snapshots. 



Sensor i 




► 5 



it 



Fig. 3. Quantizer hardware example. The sensing model described by the Qf t h (') function in J2.lt can be implemented by 
a comparator with a uniformly distributed threshold. These thresholds are independent across sensors, but arbitrarily correlated 
across snapshots, allowing many scenarios to be accommodated. 



Each sensor i utilizes a block encoder to "compress" its vector of T quantized samples Bj to a message 
Mi £ {1,2, ... , 2 rT } before transmitting to the fusion center. The block encoder and message for sensor 
i are given by 

fi : {0, 1} T -» {1, 2, . . . , 2 rT }, Mi = fi(Ba, B iT ), 

where r is the coding rate in bits per sensor per snapshot. For r > 1 compression is trivial since Bj can 
assume no more than 2 T distinct values which can be indexed using T bits. 
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D. Transmission and Field Reconstruction 

In this work, a data fusion center is any point of data aggregation and/or processing in the sensor 
network and can be real or virtual. For instance, sensors can be dynamically organized into clusters with 
different sensors assuming the role of a fusion center at different times [23]. To conform with the existing 
base of digital communication architectures, our problem setup abstracts the underlying transmission 
network of sensors effectively as a network of bit pipes. These bit pipes are capable of reliably delivering 
these N messages (the payloads) and the network addresses of the message origination nodes (the headers) 
to the fusion center. This enables the fusion center to correctly associate the spatial location information 
with the corresponding sensor field-measurement information for reliable field reconstruction. In practice, 
sensor data can be moved to the fusion center through a variety of physical-layer transport mechanisms, 
example, a stationary base-station with directional antenna arrays, a mobile data collector, and passive 
sensor querying mechanisms involving, for instance, laser-beams and modulating mirrors [24]. 

Separating the distributed field reconstruction problem into efficient data acquisition and efficient data 
transport parts through a finite-rate reliable bit-pipe abstraction may be suboptimal [25, p. 449], [15], 
[16]. For instance, in some scenarios multihop communication is not needed and the characteristics of 
the field, the communication channel, and the distortion-metric are "matched" to one another. In such 
a scenario, uncoded "analog" transmission can offer huge performance gains if the synchronization of 
sensor transmissions can be orchestrated at the physical layer to achieve beamforming gains and the 
network channel state information is available to the transmitting sensors [15]. Certain aspects of this 
analog transmission can be incorporated within our field reconstruction framework and is briefly discussed 
in the remark just before Section IIV-AI 

For our reconstruction scheme, described in Section ITVl the fusion center only needs to be able to 
spatially localize the origin of each message to within the supercell resolution. This can be achieved by 
having each sensor append a log(LM) bits long label to its message. This results in a total sensor-location 
rate-overhead of = (N/T)log(LM) bits per snapshot on the network information transport costs. 
This overhead will be negligible if T 3> N log(LM). If the underlying sequence of fields are spatially 
constant, then, the sensor location information is not needed at the fusion center (see Corollary 13.11 and 
Section |IV]>. 

The fusion center forms the estimates of the T fields based on the sensor messages using the recon- 
struction functions 

9t :Gx{l,2,...,2 rT } N ^[-a,a], V* G {1, . . . , T}. 
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The estimate of field t at point x G G is denoted by 

S t (x)=g t (x,M 1 ,...,M N ). 

Definition 2.4: (Rate-r DFRS) A rate-r DFRS based on randomized 1-bit SQ with block coding is 
defined by the set of rate-r encoder functions {fi{-)}f = i and the set of reconstruction functions {gt(-)}f = \- 
Figure Q] depicts a rate-r DFRS using randomized 1-bit SQ with block coding. 
1) Performance Criterion: 

Definition 2.5: (Pointwise MSE) The pointwise MSE of the estimate of field t at location x G G, for 
a given rate-r DFRS and a specific noise joint cdf Fz(z) G JF, is given by 

A(x;F z ) =E[(S t (x)-s t (x)) 2 ]. 

Since we are interested in schemes that will work for any noise cdf in T, we consider the worst-case 
D t (x;Fz) over all possible Fz G T. We also consider the maximization over all fields and all locations 
in G since we want to reconstruct every point of every field with high fidelity. 

Definition 2.6: (Worst-case MSE) The worst-case MSE D is given by 

D = max sup sup D t (x\Fz). 
te{l,...,T} X £GF Z £F 

Our objective is to understand the scaling behavior of MSE with N, T, and r. The next section 
summarizes our partial results in this direction. 

III. Main Results 

A. Achievable MSE Performance 

Our first result gives an upper bound on the MSE achievable through a constructive DFRS based on 
randomized 1-bit SQ with block coding for rate r = 1/M, where M is the number of subcells per 
supercell. The actual scheme will be described in Section [IV] The MSE analysis appears within the proof 
of the theorem detailed in Appendix J] This achievable MSE upper bound can be made to decrease to 
zero as sensor-density goes to infinity (see (13.11 )) without knowledge of the local or global smoothness 
properties of the sequence of fields. Furthermore, this scheme is universal in the sense that it does not 
assume knowledge of i*z(z) beyond membership to T . 

Theorem 3.1: (Achievable MSE performance: Randomized 1-bit SQ and r = 1/M) There exists a 
rate-r = 1/M DFRS based on randomized 1-bit SQ with block coding (e.g., the scheme of Section Hvb 
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such that for all x € G, t 6 {1, . . . , T}, and F z (z) G J 7 , 

\ / LMc 2 ' 



. > 2 / \ fLMc 2 \ 

D t (x;F z ) < uU—xj + ^-^j 




N 



Proof: See Section [TV] and Appendix U 
Note that Theorem 13.11 holds for arbitrary fields. The modulus of continuity terms in the local (first) and 
global (second) upper bounds of Theorem 13. II are due to the bias of the field estimates and the ^ L %° 2 ^ 
term is due to the variance of the field estimates (see (14.21 ) in Section iTVl ). From Theorem 13.11 and the 
properties of moduli of continuity (see Section Ill-Al l, it follows that for the coding and reconstruction 
scheme of Section [TV] as N — ► oo, the estimate S t (x) uniformly converges, in a mean square sense, to 
st(x) for all ieG, provided that 

(i) I — 1 — ► oo, and (ii) L — ► oo. (3.1) 



It also follows that the worst-case MSE scaling behavior (see Definition 12.61 ) is bounded by 
and that D — > as N and L scale as in (13.1b . 

Implications: These results allow us to make the per sensor per snapshot bit rate r, worst-case MSE 
D, and sensor message ID overheads (given by (N/T) log (LM) bits) simultaneously smaller than any 
arbitrarily small desired values r* , D* , e > 0, respectively. First, we can choose a sufficiently large number 
of subcells per supercell M* such that the rate r = 1/M* < r*. Then we can choose a sufficiently large 
number of sensors N* and number of supercells L* such that the bound on D given by (13.21 ) is made 
less than D*. Note that both N* and M* can be further increased while keeping the ratio M*/N* 
fixed without changing the bound on D. This corresponds to increasing the total number of sensors N, 
decreasing the per sensor rate r = 1/M, but keeping the total network per snapshot rate Nr = N/M 
and distortion D fixed. Finally, we can look at a sufficiently large number of snapshots T* such that 
network message overheads (N*/T*) log(L*M*) < e. 

In the constructive coding and field reconstruction scheme of Section [IVJ the field estimates are 
piecewise constant over the supercells. The estimate in each supercell is formed from only n = (N/ (LM)) 
of the Mn = (N/L) quantized observed values coming from the sensors located in that supercell. Since 
only (1/M) of the total available quantized observed values for each snapshot are used, the transmission 
rate of (1/M) is achievable by indexing only the necessary values (see Section [IV] for details). As the 
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number of supercells L increases, the piecewise constant estimate becomes finer and the bias is decreased. 
Also, as the number of sensors per supercell is increased, more observations are used thus decreasing 
the variance of the estimate. 

Since the variance term in the upper bound of Theorem 13. II can decrease no faster than 0(l/N), 
the decay of the global MSE upper bound, in the proposed constructive scheme, can be no faster than 

0(1/N). However, the decay rate of hls ff is hindered by the fact that L simultaneously needs to approach 

-2 ( Vd ' 



infinity for the bias term uif y~TT[j t0 decay to 0. When oJt(-) is not identically zero, a bias-variance 
tradeoff exists and the appropriate relative growth rate for L with N that minimizes the decay rate of 
the global MSE upper bound of Theorem 13.11 is determined by the following condition 



For certain classes of signals for which the global modulus of continuity has a closed form, the optimum 
growth rate can be explicitly determined. For instance, if d = 1 and u)t{$) = A • <5 (Lipschitz-1 fields), 
L opt (N) = 0(iV 1 / 3 ) for which MSE = 0(N~ 2 / 3 ). 

Corollary 3.1: (Achievable MSE performance: Randomized 1-bit SQ, r = l/M, and constant fields) 
If for all x G G and all t G {1, ... ,T}, we have st(x) = s t , or equivalently, for all 6 > and all 
t G {1, . . . , T}, uJt(S) = 0, then the result given by ( 13.21 ) reduces to 

- V JV 

where we can set L = 1 to minimize the bound. 

Only L = 1 supercell is needed for an accurate piecewise constant reconstruction of a constant field. 
Furthermore, all snapshot-estimates given by the scheme from Section [IV] are unbiased in this case. Also, 
the spatial locations of sensors are irrelevant: the MSE behavior is governed purely by the number of 
sensors N regardless of how they are distributed over the subcells. The N sensors must still be uniformly 
assigned to one of M groups (for the purpose of transmission coordination to achieve the compression 
factor of l/M), however these groups need not have any geographical significance. 

The MSE results given by Theorem 13.11 show that the field snapshot estimates converge uniformly in 
MSE and upper bound the MSE decay rate. Every point of every estimate, in fact, converges almost 
surely to the true value. We also state a central limit theorem (CLT) result regarding the estimation error. 

Theorem 3.2: (Almost— sure convergence of field estimates) There exists a rate-r = l/M DFRS based 
on randomized 1-bit SQ with block coding (described in Section ITVl ) such that for all x G G, t G 
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{1,...,T}, and i*z(z) G JF, 



as JV and L scale as given in (13 - 1 b - 

Proof: See Section [IV] and Appendix HH I 
Corollary 3.2: ( Central limit theorem for estimation errors) For the rate r = 1/M DFRS of Section HVl 
the normalized error at point x G G for the estimate of field snapshot t G {1, . . . , T}, given by 

St (a;) - s t (x) 
\Jvax[S t (x) ~ s t (x)} 

is asymptotically zero-mean, unit-variance, normal as N and L scale as given in (13.11 ). for any Fz(z) G J 7 . 
Proof: The proof is similar to and follows directly from the proof of Theorem 2.4 in [2]. ■ 

B. Order— Optimal Minimax MSE for Constant Fields 

The minimax reconstruction MSE over the class of constant fields is given by 

inf sup D, 

where the infimum is taken over all possible estimators and the supremum is taken over all noise 
distributions and fields from the class of constant fields which is denoted by S. The achievable MSE result 
given by Corollary 13.11 establishes an upper bound on the minimax reconstruction MSE. Theorem 13.31 
lower bounds the minimax reconstruction MSE for any rate r DFRS that produces unbiased estimates 
for the case of spatially constant fields. 

Theorem 3.3: (Lower bound on MSE: Unbiased estimators for constant fields) For a sequence of 
spatially constant fields and any DFRS which produces unbiased field estimates, there exists a joint cdf 
Fz G T such that for noise distributed according to Fz the MSE is lower bounded by 

n(S t - st) 2 } > (jfj , for all t G {1, . . . , T}, 

where Ct is finite, non-zero, and does not depend on N. Therefore, 

'C t 



inf sup D > max 
0? t }Sf F z er, St es te{i,...,T}\N, 
Proof: Since {st} — ► {Yn} — > {Bit} —>■ {Mi} forms a Markov chain, the estimates based on the 

sensor messages {Mi, . . . , M/v} cannot have a lower MSE than estimates based on the noisy observations 

{Yn}. Let Fz G T be any well-behaved, non-trivial, joint cdf such that the Zn are iid and the conditional 

probabilities of Yu given the fields satisfy the regularity conditions necessary for the Cramer-Rao bound 
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[26] to be applied. By the Cramer-Rao bound, the MSE of each field estimate based on {Yu} is lower 
bounded by ^ where Ct is finite, non-zero, and depends on Fz, but does not depend on N. Note that 
the bound also applies to general randomized 1-bit SQ functions Qu(-) including those based on uniform 
random thresholds Qf t h (-) (see (|2~TT )). ■ 
Combining the results of Corollary 13.11 and Theorem 13.31 establishes that the order-optimal minimax 
MSE for spatially constant fields is @(1/N) and that the scheme of SectionlTVl achieves this order optimal 
performance. 

IV. Proposed Constructive Distributed Coding and Field Reconstruction Scheme 

In this section we present the proposed DFRS scheme that was alluded to in Section |III] In this scheme, 
sensors create the quantized binary samples {Bu} from their observations {lit} through comparisons with 
the random thresholds {Ru}, as described in ( 12. lb of Section Hl-C.21 The field estimates are piecewise 
constant over the supercells, where the estimate formed in each supercell is a function of only (N/ (LM)) 
of the (N/L) quantized observed values coming from the sensors located in that supercell. This allows 
fractional transmission rates of r = 1/M through a simple time-sharing based compression method. Note 
that there can be uncertainty in the sensor locations, within a degree given by the size of a supercell, at 
the fusion center, since it is only necessary for the fusion center to know which supercell each sensor is 
located in. 

Each sensor i, instead of transmitting all of its T bits (the vector of its binary quantized observations 
Bj = (Bn, . . . , Bit)), transmits only rT = T/M of them and the remaining observations are dropped. Or 
alternatively, the sensor may sleep and not record the remaining measurements. The two-level hierarchy of 
supercells and subcells described in Section ITl-B I is used in order to properly determine which bits sensors 
should drop or keep. Within each supercell, each sensor i from subcell k E {1, . . . , M} communicates 
only every M th bit (offset by k), that is {B iyk+M i} l ^ /M) ~ l . These rT bits can be uniquely represented 
by the message Mj E {1, . . . , 2 rT } and losslessly communicated to the fusion center. Thus for snapshot 
t E {1, . . . , T}, only the bits from senors in the [((t — 1) mod M) + l] th subcell of each supercell are 
communicated to the fusion center. The set of all sensor indices corresponding to the n = (N/(LM)) 
sensors belonging to the [((t — 1) mod M) + l] th subcell of supercell j will be denoted by I(j,t). In 
other words, this set of indices corresponds to all those sensors which are located in supercell j and are 
responsible for recording and encoding a bit in the t-th snapshot. 

For notational simplicity, the reconstruction function St(x) = gt(x, Mi, . . . , Mn) will be described 
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directly in terms of the available binary quantized observations and not the encoded messages {Mi} 
which are information equivalent. The reconstruction function St(x) is piecewise constant and is described 
as follows. The field st(x) is reconstructed as a constant over each supercell j. The constant estimate is 
given by 



Stj = 2c 



- E B * 

iei(j,t) 



(4.1) 



which is the simple average (shifted and scaled into [— c, +c]) of the available quantized binary obser- 
vations of snapshot t from sensors located in supercell j. The overall piecewise-constant estimate for 
st(x) can be then described as 

S t (x) = g t (x,M 1 ,...,M N ) 

L 

± J2S tj l(x e Xj), (4.2) 

J'=l 

where Xj C [0, l] d is the set of points within the j th hypercube supercell and l(x G Xj) given by 



l(x € Xj) 



1 if x € Xj, 



otherwise, 

is the indicator function of the set Xj. Other more sophisticated reconstruction algorithms are possible. 
For instance, instead of the simple average used in ( 14.11 ), one may use a weighted average with convex 
weights, and for the overall reconstruction in (14.21 ). one may use a piecewise-linear or other higher-order 
interpolation algorithms such as those based on cubic B-splines (see [2]). The resulting MSE will be 
of the same order. We use the former (simple average, piecewise-constant) reconstruction because its 
description and analysis is more compact. Appendix U proves that the MSE of this constructive coding 
and reconstruction scheme is upper bounded by the result described in Theorem 13.11 
Remark: As discussed at the beginning of Section III-DI physical-layer network data transport issues 
are not the focus of this work. However, if synchronized analog beamforming from the sensors within 
each subcell to the fusion center can be achieved, then the summation in the reconstruction given by 
equation (14.11 ) can be realized directly in the analog physical layer, by "adding signals in the air", using 
a simple binary pulse amplitude modulation signaling scheme at each sensor. The additional estimation 
error variance due to the receiver amplifier noise at the fusion center will decrease as 1/n by scaling the 
sampled received waveform by 1/n as in (14- 1 b - This will lead to corresponding achievable power versus 
distortion tradeoffs (as opposed to bits versus MSE or sensors versus MSE) which can be quantified. 

7 The set of binary quantized observations for snapshot t which are available at the fusion center is given by {Bit}^ i£U L t ^y 
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A. Sensor Deployment Considerations 

The conditions given by Definition 12.31 correspond to exactly (N/(LM)) sensors uniformly falling 
into each subcell with probability one for all N, L, and M (ignoring integer affects). In the regime 
of perfect sensor placement control (or when placement error is negligible compared to the width of 
the cells), this condition is trivially realized by locating the sensors on a uniform grid. However, such 
precise sensor placement control might not be achievable in practice. In order to address this issue we 
introduce a stochastic sensor deployment model, one that captures an extreme case of (uncontrollable) 
sensor placement uncertainty where each sensor is deployed according to a uniform distribution over 
[0, l] d . We also relax the uniform sensor placement to an asymptotic nearly uniform sensor deployment 
given by Definition 14.11 

Definition 4.1: (Asymptotic nearly uniform sensor deployment) We say that a sensor deployment 
method is asymptotically nearly uniform with parameters (7, e, JV*) if at least 772 = j(N/(LM)) are 
located in each subcell with probability at least 1 — e for all N > N*, where 7 G (0, 1] represents the 
inverse of the "over-provisioning" factor for the number of sensors needed to be deployed. 

This relaxation does not significantly impact our results since we are interested in the asymptotic results 
(as L and N scale as in (13.11 )) where the 7 and e parameters of Definition [4j] can be made negligible. Our 
stochastic deployment scheme satisfies this asymptotic nearly uniform condition given in Definition 14.11 
and also almost surely satisfies the uniform deployment condition given by Definition 12.31 for N — ► 00. 

Consider the scenario where N sensors are deployed iid and uniformly over G = [0, l] d . The corre- 
sponding indices of the subcells (the total LM subcells can be indexed by an integer from 1 to LM) 
that the N sensors fall is denoted by the random sequence J = (Ji, . . . , Jjv). Here, Jj ~ iid U, where 
U = (1/(LM), . . . ,1/(LM)) is the uniform probability mass function over LM discrete values. We 
examine the iV-type (empirical distribution) Pj N ^ of J in order to examine the level of uniformity in 
the sensor deployment. An empirical distribution equal to U corresponds to the uniform deployment 
condition of Definition 12.31 being met. Since the indices are also distributed iid according to U, by the 
strong law of large numbers, the empirical distribution converges almost surely to U as ./V — ► 00, and 
thus almost surely the sensors will be deployed uniformly over the subcells according to Definition 12.31 
as N — ► 00. 

Also, a result from large deviations theory bounds the probability that the empirical distribution will 
not be in a ^-neighborhood of the uniform distribution. This corresponds to the event where there exists 
a subcell that has more than ^fct^ or fewer than N ^7, S ^ sensors located within it. Let & N be the set of 
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*»B 



R ~ Unif[-c, +c] 




(a) 



(b) 



Fig. 4. The Qf t h (') function in d2.lt and the Q(-) function of [3] suggest markedly different hardware implementations. 

The former naturally suggests (a), where the binary quantized value is produced by a simple comparison to a random threshold 
X. The latter suggests (b), where an arbitrarily-precise ADC circuitry probabilistically selects an arbitrary bit of the observed 
value. Interestingly, these two implementations produce statistically equivalent quantized outputs B given identical inputs Y. 



iV-dimensional probability distributions, U 5 = [(1 - 8)/{LM), (1 + 5)/{LM)] N be the ^-neighborhood 
around the uniform probability mass function U, D{-\\-) denote the Kullback-Leibler distance [25], and 

P* = arg min D(P\\U) 

p<zff>N\JJS 

denote the probability distribution not in II s that is closest to U in Kullback-Leibler distance. It should 
be noted that D(P*\\U) > for all 6 > 0. Then by Sanov's theorem [25, p. 292], 

P(Pf G & N \ U 5 ) < (N + l)LM 2 -ND(P'\\U) 

_ 2 -N(D(P* ||t/)- Milog(iV+l))_ (4 _ 3) 

This inequality bounds the probability that not all subcells have at least N ^/^ sensors within them. This 
shows that as long as the number of sensors deployed iV grows faster than the actual number of sensors 
needed LM, then the near uniform deployment condition will be eventually met. Thus, this determines 
how many total sensors N* > LM need to be deployed in order to satisfy the asymptotic nearly uniform 
sampling condition of Definition 14.11 for a given desired e and for 7 = 1 — S. 

V. Discussion of Related One-bit Estimation Problems and Extensions 

This section discusses the connections between the methods and results in [2], [3], and the present work. 
It is shown that the apparently different randomized 1-bit field estimation schemes in these studies are in 
fact statistically and MSE performance equivalent. We also address how, in the scenario of known noise 
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statistics, unbounded noise distributions and arbitrary threshold distributions can be accommodated. The 
general framework of the present work integrates the desirable field sensing and reconstruction properties 
and insights of the earlier studies and provides a unified view of the problem that simultaneously considers 
unreliable binary quantization, unknown arbitrary noise distributions, multiple snapshots of a temporally 
and spatially varying field, and communication rate issues. Since the work in both [2] and [3] deal with 
the reconstruction of only a single snapshot (T = 1), we will drop the snapshot indices t in our discussion 
to aid comparison. 

A. One— Bit Randomized— Dithering 

The problem setup of [2] may be viewed as the reconstruction of a single snapshot (T = 1) of 
a bounded, one-dimensional field (d = 1) from noiseless samples (Zi = 0) at uniformly spaced 
(deterministic) sampling locations (xj = i/N). In [2] the noiseless observations are binary quantized using 
random thresholds E4S that have a known general distribution which satisfies certain technical conditions 
described in [2, Section II. A]. These technical conditions include the uniform distribution (considered in 
this paper) as a special case. An important conceptual difference is that in [2] the sensor quantization 
noise is viewed as a randomized dither signal which is intentionally added to the observations and that 
the dither cdf is known (it need not be uniform). The reconstruction explicitly exploits the knowledge of 
the dither statistics. Specifically, the noiseless observation Y{, at sensor i, and the corresponding quantized 
binary sample Bi become 

Y{ — s(xj), 

Bi = QW^agniYi+Xi), 

where Xi is iid dithering noise with a known distribution px{-) which satisfies certain technical assump- 
tions as given in [2, Section II. A]. Note that taking the sign of the sum of the observation and random 
dither Xi is equivalent to comparing with the threshold — Xj. Thus the quantization function Q(-) of 
[2] is equivalent to a comparator with a random threshold that is distributed according to px(— x). 
The quantization function Qf t h (-) in (12.11 ) can be viewed as a special case of this where px{— %) is the 
uniform distribution over [— c, c]. The constructive scheme of Section ITVl and the analysis of this work 
shows that Qf t h {-) can in fact be used even on noisy field observations with an additive noise of unknown 
distribution. 

8 The sign function maps to { — 1, +1} whereas a threshold comparator maps to {0,1}. However, the replacement of the —1 
symbol with the symbol is unimportant from an estimation viewpoint. 
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B. Parameter Estimation with One-Bit Messages 

The parameter estimation problem in [3] corresponds to the special case of a spatially constant field 
(s(xi) = s for all % where the index t is omitted since T = 1) which is addressed by Corollary 13.11 We 
summarize below the key features of the randomized binary quantizer proposed in [3] and show that the 
randomized 1-bit SQ function Q(-) of [3] is statistically and MSE performance-wise equivalent to the 
uniform random threshold quantizer Qj t h {-) in (12. II ). However, the Q(-) function of [3] implicitly requires 
sensors of arbitrarily high precision, a property that is undesirable for sensor hardware implementations. 

In [3], each sensor i first shifts and scales it observation Y into interval [0, 1] creating the value 
Y = Next, each sensor i generates an auxiliary random variable aj, which is iid across sensors 

and is geometrically distributed over the set of all positive integers: F(ai = j) = 2~- ? for all j £ 
{1, 2,3,..., oo}. The final quantized binary sample Bi reported by sensor i is given by the otf bit in the 
binary expansion of Yf 

Bi = Q(Yi) =B(Yi,ai), 

oo 

where Y i = Y,B(Y i ,j)2~K (5.1) 
i=i 

Here, B(Yi,j) denotes the j th bit of Yj. For example, if Y = 0.375, then the first four bits of its binary 
expansion are given by B{%1) = 0, B(Y h 2) = 1, B(Y h 3) = 1, and -B(Y,4) = 0. If a, = 3, then 
sensor i reports Bi = 1. This method for generating binary sensor messages requires sensors to have 
the operational ability to quantize an observed real number (the normalized values Y) to an arbitrarily 
high bit-resolution. Note that the binary values Bi are iid across all sensors and that its expected value 
is given by 

E[Bi] = Ey,\E Bi [Bi\Yi\] 



v, 

oo 



%[Y] 

ir. \ Y i + c 
E 



2c 

E[s + Zi] + c fs + c 



(5.2) 

V 2c < ■ (5 " 3) 
In sharp contrast to the Q(-) function described above, which requires sensors to have the operational 
ability to resolve any arbitrary bit in the binary expansion of their normalized observations, Qf t h (-) 
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requires only a noisy comparator. Despite the markedly different operational implementations of Q(-) and 
Qf t h (-) (see (15.11 ). (12- lb . and Fig. [4] which depicts hardware implementations) they are in fact statistically 



Pb\ s {') functions where PB it \Y u (') i s tne conditional expectation of Bi given Yi = yi and Pb\ s (') is tne 
unconditional expectation of Bi parameterized by the underlying field value s(xj) = s. These expectations 
have been evaluated in (I5.2I ). (15.31 ). (11.11 ) and (II.2I ). and we see that for both functions 



This statistical equivalence allows the two quantization functions Q(-) and Qf t h (-) to be interchanged 
without affecting the estimation performance. 

C. Extensions to Unbounded Noise and Arbitrary Thresholds with Known Distributions 

In this work, we have made assumptions of zero-mean, amplitude-bounded, additive noise, which can 
have an arbitrary, unknown distribution, and uniformly distributed quantization thresholds. The results of 
this work can be extended to deal with noise that is not amplitude-bounded (i.e. Gaussian, Laplacian, 
etc.) and for thresholds with arbitrary distributions, however certain technical conditions must be met and 
the distributions for both the noise and the threshold must be known. A possible approach is to combine 
the noise and threshold random variables into an overall random dither variable Xu = Zu + Ru and 
applying the method and results used in [2] (see Section IV-AI) . The main MSE result of Theorem 13. II will 
still hold, however with new constants multiplying each term in the bound. The method is essentially 
the same as in Section |IVl however the value of the field estimate at every point is passed through a 
non-linear function, instead of a simple scaling and shifting, given by 



with fi(s) = 1 — 2Px(— s) where Px(-) is the cdf of the dither random variable Xa. The technical 
requirement for this extension is that Px{-) is absolutely continuous with a probability density function 
px(-) on (— oo,oo) which is continuous and positive over an open interval (—a', a') containing [—a, a]. 
This ensures that Px{ m ) is strictly monotonically increasing over the signal dynamic range and that /U _1 (-) 
exists (see [2]). 



identical: the binary quantized values Bi generated by the two schemes have the same Pb u \y u (') an d 
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If the sensor observation noise is unbounded (but the field st(x) is still bounded), has zero-mean, and 
has an unknown probability density function (pdf) whose tails decay to zero, it is still possible to make a 
weak statement about the achievable MSE as the number of sensors go to infinity. With unbounded noise, 
the sensor observations may exceed any finite dynamic range [— c, c] of the one-bit sensors. This leads to 
the appearance of additional error bias terms (see equation (11.21 ) and (11.41 ) in Appendix U) which depend 
on the unknown signal value st(x) to be estimated and the dynamic range limit c. It can be shown that 
these terms go to zero as c — > oo. Thus one can assert that for a sufficiently large dynamic range limit 
c and a corresponding sufficiently large number of sensors N(c), the MSE can be made smaller than 
some desired tolerance. The actual scaling behavior will now also depend on the tail decay rate of the 
unknown pdf of the observation noise. 

VI. Concluding Remarks 

The results of this work show that for the distributed field reconstruction problem, for every point of 
continuity of every field snapshot, it is possible to drive the MSE to zero with increasing sensor density 
while ensuring that the per-sensor bitrate and sensing-related network overhead rate simultaneously go 
to zero. This can be achieved with noisy threshold (one-bit) comparators with the minimal knowledge 
of signal and noise dynamic ranges provided that the noise samples are zero-mean, and independent 
across sensors and the underlying field, and the sensor placement and sampling schedule satisfy a certain 
uniformity property. The rate of decay of MSE with increasing sensor density is related to the the local and 
global smoothness characteristics of the underlying fields and is order-optimal for the class of spatially 
constant fields. This work has further clarified the utility of randomization for signal acquisition to combat 
limited sensing precision and unknown noise statistics in a distributed sensor network context. This work 
has also attempted to systematically account for sensor placement and hardware issues in addition to the 
typical constraints encountered in related studies. 

Acknowledgment 

The authors would like to thank Nan Ma and Manqi Zhao from the ECE department of Boston 
University for helpful comments and suggestions during different stages of this work. This material 
is based upon work supported by the US National Science Foundation (NSF) under award (CAREER) 
CCF-0546598, (CAREER) ECS-0449194, CCF-0430983, and CNS-0435353, and ONR (PECASE) grant 
no. N00014-02-100362. Any opinions, findings, and conclusions or recommendations expressed in this 
material are those of the authors and do not necessarily reflect the views of the NSF and ONR. 



22 



Appendix I 
Proof of Theorem 13. II 

First, note that the expected value of the binary message Ba is given by 







E[i(y it > R it )] 

E Yh [E Rh [1{Y u > R it )\Y lt ] 
E YH [F(R lt <Y lt \Y it )] 



E 



2c 



(«) 



E[s f (xj) + Z it ] +c 
2c 

gt(Xj) + c 

2c 



(1.1) 



(1.2) 



which is the value of the field st(-) at location Xj shifted and normalized to the interval [0,1]. Note that 
the key steps are step (z) where we used the fact that Ru is uniformly distributed over [— c, c] and step 
(ii) where we used the fact that Za has zero mean. It should be noted that the final result (11.21 ) holds 
for any Fz(z) & T. 

Using (11.21 ) we can bound the bias and the variance of the estimator St(x). The bound on the MSE 
follows from the bounds on these values since, for any estimator of a non-random parameter, we have 



MSE (S t (x)) = bias 2 (s t (x)) + var (s t { 



(1.3) 



Let j 6 { 1 , . . . , L} denote the index of the supercell that x falls in. We bound the magnitude of bias 
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of the estimate St(x) in the following way 

bias (St(x) 



S t (x) - s t (a;)j 



iei(j,t) 



-c - s t (x) 



2c 



- E 



iel(j,t) 
-c - s t (x) 



2c 



-c - s t (x) 



gt(Xj) + c 

2c 



ie/(i,i) 



< - E l s t(a!i) - st(x)| 

ie/(j,t) 

< - 2^ OJ t (\\x - Xi\\,X) 



n 

iel(j,t) 

(Hi) i 

< - E ^ 



Vd 



ii 



(iv) _ 
< U t 



i£l(j,t) 

(Vd 
\Vl 

( Vd 
L 



L 



(1.4) 



where (i) follows from (11.21 ). (ii) and (iv) follow from Definitions 12. II and I2.2[ and (Hi) follows because 
the local modulus of continuity is a nondecreasing function of its first argument for each fixed value of 
its second argument and since any sensor in the supercell containing x is within distance -^L of x (the 
length of the diagonal of a supercell). 
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The variance of the estimate is bounded by 



vax[St(x)] = var 



< 



- J2 B * 

n 

iel(j,t) 

^pj MBit] 
' iei(j,t) 

4c 2 \ /n\ _ ( LMc 2 



(1.5) 



(1.6) 



\AJ V N 

where we used standard properties of variance and the fact that {B it } are independent to obtain dl.51 ), 
and we used the fact the variance of a Bernoulli{0, 1} random variable is bounded by (1/4) and that 
n = (N/(LM)) to obtain (lL6l 

Combining these bounds for the bias and variance given in dl.4| ) and dl.61 ) of the estimator and using 
the identity in (Pi we get the desired bound on the MSE for all x G G, t G {1, . . . , T}, and F z (z) G T. 



Appendix II 
Proof of Theorem |3.2| 



First, we note that 



S t (x) ^ s t (x) = S t (x) - s t (x) 
Thus, we proceed with the triangle equality to bound 



0. 



(II.l) 



St(x) - s t (x) 



< 



+ 



S t (x)-E S t (x 



E 



S t (x) - s t {x) 



(11.2) 



In the proof of Theorem U given in Appendix U we have shown that the second term of dll.21 ), which is 
the absolute value of the estimator bias, is bounded by dl.4b which shows that 



E 



S t (x) - s t (x) 







(II.3) 



as N and L scale as in (13- 1 b - 

Letting j denote the supercell that x falls in, the first term of dll.2b can be rewritten as 



St(x)-E S t (x 



2c 



- ^ B lt -E[B lt ] 



i£l(j,t) 
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Recall that the cardinality of I(j,t) is n = (N/(LM)). Since the Ba random variables are independent 
across sensors and their fourth central moments are uniformly bounded by 1 (since they are binary {0,1} 
random variables), the strong law of large numbers [27, pp. 206-207] can be applied to obtain 



- V Bit - E[J3t; 



n * — ' 

i£l(j,t) 



as N — ► oo (since n = (N/(LM))) and thus the first term of (III.2 



S t (x)-E S t (x) 



0. (II.4) 



Combining (III.3I ) and (III.4I ) into (III. lb and (111.21 ) finishes the proof. ■ 
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